# Multi-speaker support

Kartoffel Orpheus 3B German Natural V0.1
A German text-to-speech (TTS) model based on Orpheus-3B, primarily fine-tuned on natural human speech recordings, aiming to achieve realistic sound.
Speech Synthesis Transformers Supports Multiple Languages
K
SebastianBodza
551
7
Kartoffel Orpheus 3B German Synthetic V0.1
A German text-to-speech (TTS) model based on Orpheus-3B, supporting multiple speakers and emotional expression.
Speech Synthesis Transformers Supports Multiple Languages
K
SebastianBodza
147
6
Sauerkrauttts Preview 0.1 Q4 K M GGUF
SauerkrautTTS-Preview-0.1 is a German text-to-speech (TTS) model fine-tuned from a powerful base model, offering four unique German speaker voices.
Speech Synthesis German
S
VAGOsolutions
300
4
Kokoro 82M V1.1 Zh
Apache-2.0
Kokoro is an open-weight series of small yet powerful text-to-speech (TTS) models, now featuring data from 100 Chinese speakers sourced from professional datasets.
Speech Synthesis
K
hexgrad
51.56k
112
Yarngpt
Apache-2.0
YarnGPT is a text-to-speech (TTS) model specifically designed for synthesizing Nigerian-accented English, utilizing pure language modeling technology to deliver high-quality, natural, and culturally relevant speech synthesis for diverse applications.
Speech Synthesis Transformers English
Y
saheedniyi
124
34
Hindi Text To Speech Tts
MIT
Hindi text-to-speech model fine-tuned based on microsoft/speecht5_tts
Speech Synthesis Transformers
H
ShigrafS
23
0
E2 TTS
F5-TTS is a fully non-autoregressive zero-shot text-to-speech model that supports high-quality speech synthesis.
Speech Synthesis
E
SWivid
32.58k
48
Tts Ru Free Hf Vits Low Multispeaker
Apache-2.0
A Russian text-to-speech model supporting multiple speakers, capable of directly processing plain text with punctuation without prior conversion to phonemes.
Speech Synthesis Transformers Other
T
utrobinmv
1,021
18
Speecht5 Tts Arabic
MIT
Arabic text-to-speech model fine-tuned on Microsoft's SpeechT5 architecture, trained on the Hakawati dataset
Speech Synthesis Transformers Arabic
S
Reyouf
25
3
Matxa Tts Cat Multispeaker
Apache-2.0
A Catalan multi-speaker text-to-speech model based on Matcha-TTS architecture, trained with optimal transport conditional flow matching for fast and high-quality speech synthesis
Speech Synthesis Other
M
projecte-aina
21
2
Tts Vits Ru Hf
This is a Russian text-to-speech model based on the VITS architecture, capable of converting Russian text into natural speech.
Speech Synthesis Transformers Other
T
joefox
382
14
Vits Vctk
MIT
VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences. The model employs a conditional variational autoencoder (VAE) architecture, including a posterior encoder, decoder, and conditional prior module.
Speech Synthesis Transformers
V
kakao-enterprise
3,601
13
Vits Ljs
MIT
VITS is an end-to-end speech synthesis model capable of predicting corresponding speech waveforms from input text sequences.
Speech Synthesis Transformers
V
kakao-enterprise
1,127
41
Nvidia Tts En Hifitts Hifigan Ft Fastpitch
HiFiGAN is a GAN-based vocoder model capable of generating high-quality audio from mel-spectrograms, supporting multi-speaker English voice synthesis.
Speech Synthesis English
N
Mastering-Python-HF
16
0
Speecht5 Tts Common Voice 5 Sv
MIT
A Swedish text-to-speech model fine-tuned based on Microsoft's SpeechT5 architecture, trained using the Common Voice dataset
Speech Synthesis Transformers Other
S
GreenCounsel
27
1
Kan Bayashi Libritts Xvector Vits
A text-to-speech model trained using the ESPnet framework, trained on the LibriTTS dataset, supporting English speech synthesis.
Speech Synthesis English
K
espnet
61
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase